LIMSI ICD10 coding Experiments on CépiDC Death Certificate Statements
نویسندگان
چکیده
We describe LIMSI experiments in ICD10 coding of death certificate statements with the CépiDc dataset of the CLEF eHealth 2016 Track 2. We tested a classifier with humanly-interpretable output, based on IR-style ranking of candidate ICD10 diagnoses. A tf.idf-weighted bagof-feature vector was built for each training set code by merging all the statements found for this code in the training data. Given a new statement, we ranked candidate codes with Cosine similarity. Features included meta-information and n-grams of normalized tokens. We also prepared an ICD chapter classifier with the same method and used it to rerank the top-k codes (k=2) returned by the code classifier. For development we focused on mono-code statements and obtained a P@1 of 0.749 increased to 0.778 by chapter reranking. On the test data we returned one code for each statement, leaving multiple code assignment for future work, and obtained a precision, recall and F-measure of (0.7650, 0.5686, 0.6524).
منابع مشابه
SIBM at CLEF eHealth Evaluation Lab 2017: Multilingual Information Extraction with CIM-IND
This paper presents SIBM’s participation in the Task 1: Multilingual Information Extraction ICD10 coding of the CLEF eHealth 2017 evaluation initiative which focuses on named entity recognition in French and English death certificates. We addressed the identification of relevant clinical entities within the International Classification of Diseases version 10 (ICD10) in the CépiDC and CDC datase...
متن کاملICD10 Coding of Death Certificates with the NCBO and SIFR Annotator(s) at CLEF eHealth 2017 Task 1
The SIFR BioPortal is an open platform to host French biomedical ontologies and terminologies based on the technology developed by the US National Center for Biomedical Ontology (NCBO). The portal facilitates the use and fostering of terminologies and ontologies by offering a set of services including semantic annotation. The SIFR Annotator (http://bioportal.lirmm.fr/annotator) is a publicly ac...
متن کاملCLEF eHealth 2017 Multilingual Information Extraction task Overview: ICD10 Coding of Death Certificates in English and French
This paper reports on Task 1 of the 2017 CLEF eHealth evaluation lab which extended the previous information extraction tasks of ShARe/CLEF eHealth evaluation labs. The task continued with coding of death certificates, as introduced in CLEF eHealth 2016. This largescale classification task consisted of extracting causes of death as coded in the International Classification of Diseases, tenth re...
متن کاملبررسی تأثیر خطاهای تکمیل گواهی فوت بر کدگذاری علت زمینه ای مرگ در بیمارستان شهید محمدی بندرعباس
Introduction: Death information plays a critical role in the adjustment of health plans, and the cause of death coding leads to organizing this information .The Purpose of this study was to review the impact of errors in the completion of death certificate on underlying the cause of death coding in Shahid Mohammadi hospital in Bandarabbas. Methods : This descriptive-cross sectional study...
متن کاملReliability of cause of death coding: an international comparison Concordancia en la codificación de causas de muerte: una comparación internacional Confiabilidade de codificação das causas de óbito: uma comparação internacional
This study evaluates the agreement of nosologic coding of cardiovascular causes of death between a Chilean coder and one in the United States, in a stratified random sample of death certificates of persons aged ≥ 60, issued in 2008 in the Valparaíso and Metropolitan regions, Chile. All causes of death were converted to ICD10 codes in parallel by both coders. Concordance was analyzed with inter-...
متن کامل